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Abstract — The fc-core decomposition in a grapli is a fundamen- 
tal problem for social network analysis. The problem of fc-core 
decomposition is to calculate the core number for every node in 
a graph. Previous studies mainly focus on fc-core decomposition 
in a static graph. There exists a linear time algorithm for fc-core 
decomposition in a static graph. However, in many real-world 
applications such as online social networks and the Internet, the 
graph typically evolves over time. Under such applications, a key 
issue is to maintain the core number of nodes given the graph 
changes over time. A simple implementation is to perform the 
linear time algorithm to recompute the core number for every 
node after the graph is updated. Such simple implementation 
is expensive when the graph is very large. In this paper, we 
propose a new efficient algorithm to maintain the core number 
for every node in a dynamic graph. Our main result is that 
only certain nodes need to update their core number given the 
graph is changed by inserting/deleting an edge. We devise an 
efficient algorithm to identify and recompute the core number of 
such nodes. The complexity of our algorithm is independent of 
the graph size. In addition, to further accelerate the algorithm, 
we develop two pruning strategies by exploiting the lower and 
upper bounds of the core number. Finally, we conduct extensive 
experiments over both real-world and synthetic datasets, and the 
results demonstrate the efficiency of the proposed algorithm. 

I. Introduction 

In the last decade, online social network analysis has 
become an important topic in both research and industry 
communities due to a larger number of applications. A crucial 
issue in social network analysis is to identify the cohesive 
subgroups of users in a network. The cohesive subgroup 
denotes a subset of users who are well-connected to one 
another in a network [16]. In the literature, there are a larger 
number of metrics for measuring the cohesiveness of a group 
of users in a social network. Examples include cliques, n- 
cliques, ?i-clans, fc-plexes, /c-core, /-groups, fc-trusses and so 
on [13]. 

For most of these metrics except fc-core, the computational 
complexity is typically NP-hard or at least quadratic, fc-core, 
as an exception, is a well-studied notion in graph theory and 
social network analysis [22]. Through-out the paper, we will 
interchangeably use graph and network. Given a graph G, the 
fc-core is the largest subgraph of G such that all the nodes in 
the fc-core have at least degree fc. For each node v in G, the 
core number of v denotes the largest fc-core that contains v. 
The fc-core decomposition in a graph G is to calculate the core 
number for every node in G. There is a linear time algorithm, 
devised by Batagelj and Zaversnik [8], to compute the fc-core 
decomposition in a graph G. 




Fig. 1. An example graph. 

Besides the analysis of cohesive subgroup, fc-core decom- 
position has been recognized as a powerful tool to analyze 
the structure and function of a network, and it has many 
applications. For example, the fc-core decomposition has been 
applied to visualize the large networks [7], [4], to map, model 
and analyze the topological structure of the Internet [10], [5], 
to predict the function of protein in protein-protein interaction 
network [17], [1], [23], to identify influential spreader in 
complex networks [18], as well as to study percolation on 
complex networks [15]. 

From the algorithmic perspective, efficient and scalable 
algorithms for fc-core decomposition in a static graph already 
exist [8], [12], [21]. However, in many real-world applications, 
such as online social network and the Internet, the network 
evolves over time. In such a dynamic network, a crucial 
issue is to maintain the core number for every node in a 
network provided the network changes over time. In a dynamic 
network, it is difficult to update the core number of nodes. The 
reason is as follows. An edge insertion/deletion results in the 
degree of two end-nodes of the edge increase/decrease by 1. 
This may lead to the updates of the core number of the end- 
nodes. Such updates of the core number of the end-nodes may 
affect the core number of the neighbors of the end-nodes which 
may need to be updated. In other words, the update of the core 
number of the end-nodes may spread across the network. For 
example, in Fig. [T] assume that we insert an edge {vs,vio) 
into the graph, resulting in the degree of ug and viq increase 
by 1. Suppose the core number of ug and viq increase by 1, 
then we can see that such core number update leads to the 
core number of uio's neighbors (wg, ui8,wii) that may need 
to be updated. And then the update of core number of vio's 
neighbors will result in the update of core number of wio's 
neighbors' neighbors. This update process may spread over 
the network. Therefore, it is hard to determine which node in 
a network should update its core number given the network 
changes. 



To update the core number for every node in a dynamic 
graph, in [20], Miorandi and Pellegrini propose to use the 
Unear algorithm given in [8] to recompute the core number for 
every node in a graph. Obviously, such an algorithm is expen- 
sive when the graph is very large. In this paper, we propose a 
efficient algorithm to maintain the core number for each node 
in a dynamic network. Our algorithm is based on the following 
key observation. We find that only a certain number of nodes 
need to update their core number when a graph is updated by 
inserting/deleting an edge. Reconsider the example in Fig. [1] 
After inserting an edge {vg,Vio), we can observe that only the 
core number of the nodes {vs, Vio, vig,, Vg, U2} updates, while 
the core number of the remaining nodes does not change. The 
key challenge is how to identify the nodes whose core numbers 
need to be updated. To tackle this problem, we propose a 
three-stage algorithm to update the core number of the nodes. 
First, we prove that only the core number of the nodes that 
are reachable from the end-nodes of the inserted/deleted edge 
and their core numbers equal to the minimal core number of 
the end-nodes may need to be updated. Based on this, we 
propose a coloring algorithm to find such nodes whose core 
numbers may need to be updated. Second, from the nodes 
found by the coloring algorithm, we propose a recoloring 
algorithm to identify the nodes whose core numbers definitely 
need to be updated. Third, we update the core number of 
such nodes by a linear algorithm. The major advantage of 
our algorithm is that its time complexity is independent of the 
graph size, and it depends on the size of the nodes found by 
the coloring algorithm. To further accelerate our algorithm, 
we develop two pruning techniques to reduce the size of the 
nodes found by the coloring algorithm. In addition, it is worth 
mentioning that our proposed algorithm can also be used to 
handle a batch of edge insertions and deletions by processing 
the edges one by one. Also, the proposed technique can be 
applied to process node insertions and deletions, because node 
insertions and node deletions can be simulated by a sequence 
of edge insertions and edge deletions respectively. Finally, we 
extensively evaluate our algorithm over 15 real- world datasets 
and 5 large synthetic datasets, and the results demonstrate 
the efficiency of our algorithm. More specifically, in real- 
world datasets, our algorithm reduces the average update time 
over the baseline algorithm from 3.2 times to 101.8 times 
for handling a single edge update. For handling a batch of 
edge updates, our algorithm needs to process the edge updates 
one by one, while the baseline algorithm only needs to run 
once for all edge updates. In the largest synthetic dataset (5 
million nodes and 25 million edges), the results show that our 
algorithm is still more efficient than the baseline algorithm 
when the number of edge updates is smaller than 4700. 

The rest of this paper is organized as follows. We give 
the problem statement in Section We propose our basic 
algorithm as well as the pruning strategies in Section |III] 
Extensive experimental studies are reported in Section HV] and 
the related work is discussed in Section |V] We conclude this 
work in Section [Vl] 



II. Preliminaries 

Consider an undirected and unweighted graph G = {V, E), 
where V denotes a set of nodes and E denotes a set of 
undirected edges between the nodes. Let n = and m = \E\ 
be the number of nodes and the number of edges in G, 
respectively. A graph G' = {V',E') is a subgraph of G if 
V' C V and E' C E. We give the definition of the /c-core 
[22] as follows. 

Definition 2.1: A A:-core is the largest subgraph G' of G such 
that each node in G' has at least a degree k. 

The core number of node v is defined as the largest /c-core 
that contains this node. We denote the core number of node 
V as Cy. It is worth noting that the nodes with a large core 
number are also in the low order core. That is to say, the cores 
are nested. For example, assuming a node ?j is in a 3-core, then 
node V is also in 2-core, 1-core and 0-core. 

Given a graph G, the problem of fc-core decomposition is to 
determine the core number for every node in G. The following 
example illustrates the concept of fc-core composition in graph. 

Example 2.1: Fig. [T] shows a graph G that contains 18 
nodes, i.e., vi, - ■ ■ ,vis- By Definition 12.11 we can find that 
the nodes W3, • • • ,vj form a 4-core. The reason is because 
the induced subgraph by the nodes , • • • , vj is the largest 
subgraph in which the degrees of nodes are lager than or 
equal to 4. Similarly, the subgraph induced by the nodes 
7J3, • ■ • , Vf, 1)14, ■ • ■ , vir is a 3-core, and the whole graph G is 
a 2-core. Here we can find that the nodes W3 , ■ • ■ , W7 are also 
in the 3-core and 2-core. □ 

It is well known that the fc-core decomposition in a static 
graph can be calculated by a 0{n + m) algorithm [8]. In many 
applications such as online social networks, the graph evolves 
over time. In this paper, we consider the problem of updating 
the core number for every node in the graph given the graph 
changes over time. In this problem, we assume that the core 
numbers of all the nodes have been known before the graph 
is updated. The potential change in our problem is that either 
edge insertion or edge deletion may result in the core number 
of a number of nodes that needs to be updated. Previous 
solution for this problem [20] is to perform the 0{n + m) 
core decomposition algorithm to re-compute the core number 
for every node in the updated graph. Clearly, such algorithm is 
expensive when the graph is very large. In the following, we 
mainly focus on devising more efficient algorithm for fc-core 
decomposition in a graph given the graph is updated by an 
edge insertion or deletion. Our proposed algorithm can also 
be used for processing a batch of edge updates. Moreover, 
since node insertions and deletions can be easily simulated as 
a sequence of edge insertions and edge deletions respectively, 
our algorithm can also be applied to handle node insertions 
and node deletions. 

III. The proposed algorithm 

Let N{v) be the set of neighbor nodes of node v, Dy 
be the degree of node v, i.e., Dy = \N{v)\. Then, we give 
two important quantities associated with a node v as follows. 
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Specifically, we define Xy as the number of ij's neighbors 
whose core numbers are greater than or equal to Cy, and 
define as the number of u's neighbors whose core numbers 
are strictly greater than C^. Formally, for a node v, we have 
Xy ^ \{u : u € N{v),Cu > Cy}\ and Yy = \{u : u e 
N{v),Cu > Cv}\. In effect, by definition, Xy denotes the 
degree of node v in the Ct,-core. The following lemma shows 
that Cy is bounded by Yy and Xy. 

Lemma 3.1: For every node v of a graph G, we have Yy < 

Cy < Xy < Dy. 

Proof: We denote the subgraph G' = {V',E') as the C„- 
core. Obviously, node v is in G'. By Definition 12.11 in G", 
node V has at least Gy neighbors, and the core number of all 
the nodes in G' is at least Gy. In other words, the number 
of v's neighbors whose core numbers are larger than or equal 
to Gy is at least Cy. By definition, Xy denotes such number. 
Therefore, we have Cy < Xy. In addition, by definition, we 
clearly know that Xy < Dy. For K„ < Cy, we can prove it by 
contradiction. Suppose Yy > Cy, then node v has more than 
Gy neighbors whose core numbers are strictly greater than 
Gy . By Definition 12.11 the core number of node v should be 
at least Cy + 1, which is a contradiction. This completes the 
proof. □ 
In the following, we give an example to illustrate the 
concepts of Xy and Y^. 

Example 3.1: Consider the node wg in Fig.[T] By definition, the 
core number of node ug is 2, i.e., Cyg = 2, and the degree of vg 
equals to 3, i.e., Dyg=3. Node vg has three neighbors (v2,vt, 
and vio) whose core number is greater than or equal to 2, and 
has one neighbor (vj) whose core number is strictly greater 
than 2. Therefore, we have Xy^ = 3 and Yy^ = 1, which 
consists with Lemma [TTI Similar results can be observed from 
other nodes in Fig. [T] □ 

Below, we define the notion of induced core subgraph. 

Deflnition 3.1: Given a graph G = iV,E) and a node v, the 
induced core subgraph of node v, denoted as Gy = {Vy,Ey), 
is a connected subgraph which consists of node v. Moreover, 
the core number of all the nodes in Gy is equivalent to Cy. 

By Definition 13.11 the induced core subgraph of node v 
includes the nodes such that they are reachable from v and 
their core numbers equal to Cy. Based on Definition 13.11 we 
define the union of two induced core subgraphs. 

Definition 3.2: For two nodes u and v and their corresponding 
induced core subgraph Gu = {Vu,Eu) and Gy = {Vy,Ey), 
the union of Gy and Gy is defined as Guuv ~ {Vuuv, Eyuy), 
where Kud = K U K and Eyuv {{vi,Vj)\{vi,Vj) £ 

E,Vi (z VuUvtVJ G VnUv}- 

It is worth mentioning that the union of two induced core 
subgraphs may not be connected. The following example 
illustrates the definitions of induced core subgraph and union 
of two induced core subgraphs. 

Example 3.2: Consider the nodes vs and via in Fig. [T] By 
definition, the induced core subgraph of vs is a subgraph 
that only contains node us. That is to say, V^g = {vs} and 



Ey^ = 0. The induced core subgraph of node vio is a subgraph 
that includes nodes {vi, V2, wg, wio: ^n, wi2, t'la, wis}- In other 
words, {wi,W2,W9,wio,wii,vi2,wi3,wi8} and Ey^„ = 

{{V1,V2), {V2,vg), (wg, Viq), (viq, Vll), (viO, Wls), (wil, W12), 

(^'11, ^'13)}- The union of these two induced core sub- 
graphs is GygOvio = (KjgUDioi^usUuio)' where T^gUi'io — 
{wi,'(;2,W8,W9,wio,'yii,wi2,wi3,wi8} and Eyguy^„ = Ey^„. 
Fig. |2] illustrates the union of two induced core subgraphs 




Fig. 2. The union of two induced core subgraphs (G^gUuio)- 

Based on Definition 13.11 and 13.21 we give a fc-core update 
theorem. 

Theorem 3.1: (k-core update theorem) Given a graph G = 
(y, E) and two nodes u and v. 

• If Gu > Cy, then either insertion or deletion of an 
edge (u, v) in C, only the core number of nodes in the 
induced core subgraph of node v, i.e., Gy, may need to 
be updated. 

• IF Gu < Cy, then either insertion or deletion of an edge 
(u, v) in a graph G, only the core number of nodes in 
the induced core subgraph of node u, i.e., Gu, may need 
to be updated. 

• IF Gu = Cy, then either insertion or deletion of an edge 
(u, v) in a graph G, only the core number of nodes in 
the union of two induced core subgraphs Gy and Gy, 
i.e., Guuv, niay need to be updated. 

To prove Theorem 13.11 we first give some useful lemmas 
as follows. 

Lemma 3.2: Given a graph G = (V, E) and a node u. If the 
core number of node u's neighbors increases (decreases) by 
at most 1, then Gu increases (decreases) by at most 1. 
Proof: First, we prove the increase case by contradiction. 
Suppose that Cu increases by at least 2. This implies that there 
are at least C„ + 2 neighbors of node u whose core numbers 
are larger than or equal to C„+2. Since the core number of u's 
neighbors increases by at most 1, the number of m's neighbors 
whose core numbers are larger than or equal to C„ + 2 is at 
most Yy. By Lemma 13.11 we know that Yu < Cy That is 
to say, the number of u's neighbors whose core numbers are 
larger than or equal to C„ + 2 is bounded by Cy, which is a 
contradiction. 

Second, we prove the decrease case. If the core number of 
the neighbors of node u decreases by at most 1, then u has 
at least Xu neighbors whose core numbers are greater than or 
equal to Cy — 1- Since Xy > Cy > Gu — 1, the core number 
of node u is at least C„ — 1. Therefore, Cy decreases by at 
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most 1. This completes the proof. □ 

Lemma 3.3: If we insert (delete) an edge {u,v) in a graph 
G, the core number of any node in G increases (decreases) 
by at most 1. 

Proof: We focus on proving the edge insertion case, and 
similar arguments can be used to prove the edge deletion case. 
After inserting an edge {u,v), both and D^, increase by 
1. Recall that Xu {X^) denotes the degree of u (v) in the 
Cti-core (Cii-core), which is a subgraph of G. Therefore, Xu 
and Xv increase by at most 1. By definition, Cu (Cu) equals 
to the minimal degree of the nodes in the C„-core (C«-core). 
Since X^ (Xy) increases by at most 1, the minimal degree of 
the nodes in the C„-core (Cy-core) increases by at most 1. As 
a result, the core number of node u (v) increases by at most 
1. Such increase of C„ (C„) may lead to increasing the core 
number of the neighbors of node u (v). Consider the one-hop 
neighbors of node u (v). According to Lemma \32\ the core 
number of all the neighbors of node u (v) increases by at most 
I. By recursively applying Lemma [32] we can conclude that 
the core number of all the nodes that are reachable from u (v) 
increases by at most I. On the other hand, the core number 
of the nodes that cannot be reachable from u (v) does not 
change. Put it all together, for any node in G, its core number 
increases by at most 1. This completes the proof. □ 

Lemma 3.4: Given a graph G and two nodes u and v such 
that Gu — Cy. If we insert an edge {u, v) in G, then either 
Gu and Gy increase by 1 or C„ and Gy do not change. 
Proof: We prove it by contradiction. Without loss of general- 
ity, after inserting an edge {u, v), we assume that C„ increases 
by 1 while G^ does not change. Since Gu increases by 1, node 
u has at least + 1 neighbors whose core numbers are larger 
than or equal to C„ + 1. By Definition l2.1l before inserting an 
edge (w, v), u has at most Gu neighbors whose core numbers 
are larger than or equal to Gu + 1- Therefore, node u's core 
number must be C„ + 1, which is a contradiction. 

Lemma 3.5: Given a graph G and an edge (it, v). Suppose G 
is updated by inserting or deleting an edge [u, v). Then, for 
any node w in G, if the core number of w ( Gu,) needs to be 
changed, such change only affects the core number of nodes 
in Gu,- If Gu, does not change, then it does not affect the core 
number of the nodes in G. 

Proof: We focus on the edge insertion case, and similar proof 
can be used to prove the edge deletion case. Assume that C„, is 
changed after inserting an edge (m, v) into G. By Lemma |331 
Gu, increases by 1. We denote the updated G^ as Gu,, i.e., 
Cw = Gu, + 1. Obviously, the increase of Gu, does not affect 
the core number of the nodes that cannot be reachable from 
w. Also, we claim that the increase of Gu, does not affect the 
core numbers of the nodes that can be reachable from w and 
their core numbers are less than or greater than Gu,- First, we 
consider a node z that are reachable from w and Gz < Gu,- 
Recall that Gz equals to the minimal degree of the nodes in 
the Cz-core. By definition, w is also in the C^-core (cores are 
nested). The increase of Gu, clearly does not increase such 
minimal degree. Hence, the core number of node z is still Cr. 



Second, we consider a node z that is reachable from w and 
Gz > Gu,- The minimal degree of the nodes in Cz-core is Gz 
and Gz > Cw- Similarly, the increase of does not increase 
such minimal degree, thereby Gz will not be updated. Put it all 
together, the increase of Gw only affects the core number of 
those nodes that are reachable from w and their core numbers 
equal to Gu,, which are the nodes in G„,. By definition, if Gu, 
does not change, then it will not affect the core number of all 
the nodes in G. This completes the proof. □ 
Armed with the above lemmas, we prove the fc-core update 
theorem as follows. 

Proof of Theorem l3.lt For the insertion of an edge (u, v), we 
consider three different cases: (1) Gu > Gu, (2) Gu < Gu, and 
(3) Gu ~ Gu- For Gu > Gu, we know that node u is in a higher 
order core than node v- By Definition 12.11 adding a neighbor 
V with a small core number to a node u does not affect Gu- 
By Lemma 13.51 since Gu does not change, node u will not 
affect the core number of the nodes in G. Consequently, we 
only need to update the core number of the nodes that are 
affected by node v. By Lemma 13.51 if Gu changes, then only 
the core number of nodes in Gu may need to be updated. If 
Gu does not change, then no node's core number needs to be 
updated. This proves the case (1). Symmetrically, we can use 
the similar arguments to prove the case (2). For case (3), after 
inserting an edge [u, v), by Lemma 13.41 either Gu and G„ 
increase by 1 or Gu and Gu do not change. If Gu and Gu do 
not change, by Lemma 13.51 we conclude that no node's core 
number needs to be updated. If G„ and Gu increase by 1, by 
Lemma [331 the core number of the nodes in Gu and Gu may 
need to be updated. That is to say, the core number of the 
nodes in Gu\ju may need to be updated. 

Similarly, for the deletion of an edge [u, v), we also 
consider three different cases: (1) G„ > Gu, (2) Gu < Gu, 
and (3) G„ = Gu- The proof for the first two cases is very 
similar to the proof for the first two cases under edge insertion 
case, thereby we omit for brevity. For G„ = Gu, after deleting 
an edge {u,v), if G„ and Gu do not change, we conclude 
that no node's core number needs to be updated according to 
Lemma [331 If G„ changes, by Lemma [331 the core number of 
nodes in G„ may need to be updated. Likewise, if Gu changes, 
the core number of nodes in G„ may need to be updated. 
To summarize, after removing an edge {u,v), only the core 
number of the nodes in Guuv may need to be updated. This 
completes the proof. □ 

A. The basic algorithm 

In this subsection, we present a basic algorithm for core 
maintenance in a graph given the graph is updated by an edge 
insertion or an edge deletion. Below, we describe the detailed 
algorithms for edge insertion and deletion, respectively. 

Algorithm for edge insertion: Our main algorithm for edge 
insertion consists of three steps. After inserting an edge {u, v), 
by the fc-core update theorem, only the core number of nodes 
in the induced core subgraph [Gu or Gu or Guuv) may need 
to be updated. Therefore, the first step of our main algorithm 
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Algorithm 1 Insertion(G', u, v) 

Input: Graph G = {V, E) and an edge [u, v) 
Output: the updated core number of the nodes 

1: Initialize visited(w) <— for all node w £V; 

2: Initialize color(ui) -s— for all node w £V; 

3: Vc ^ 0; 

4: if Cn > Cu then 

5: c^C„; 

6: Color(G, V, c); 

7: RecolorInsert(G, c); 

8: UpdatelnsertCC c); 

9: else 

10: c ^ C„; 

11: Color(G, u, c); 

12: RecolorInsert(G, c); 

13: UpdateInsert(G, c); 



Algorithm 2 void ColorCG, u, c) 

1: visited(it) <— 1; 

2: if color(u) = then 

3: Vc ^ K U {tt}; 

4: color(it) = 1; 

5: for each node w G A'^(ii) do 

6: if visited(tu) — and Cw = c then 

7: Color(G, c); 



Algorithm 3 void RecolorInsert(G, c 

1: flag ^ 0; 

2: for each node ti G Vc do 
3: if color(it) = 1 then 
4: Xn ^ 0; 

5: for each node w G N{u) do 



is to identify the nodes in the induced core subgraph. Let Vc 
be the set of nodes found in the first step. Then, the second 
step of our algorithm is to determine those nodes in Vc whose 
core numbers definitely need to be updated. Finally, the third 
step of our algorithm is to update the core number of such 
nodes. 

Our main algorithm for edge insertion, called Insertion, 
is outlined in Algorithm [1] Algorithm [1] includes three sub- 
algorithms, namely Color, Recolorlnsert, and Updatelnsert, 
which corresponds the first, the second, and the third step of 
our main algorithm, respectively. In particular. Color is used to 
color the nodes in Vc with a color 1, Recolorlnsert is applied 
to recolor the nodes in Vc whose core numbers are definitely 
unchanged with a color 0, and Updatelnsert is used to update 
the core number of the nodes in Vc with a color 1 . The detailed 
description of Algorithm [T] is as follows. First, Algorithm [T] 
assigns a color for every node in G (line 2 in Algorithm [Til 
and initializes Vc by an empty set (line 3 in Algorithm [T]i. 
Second, the algorithm updates the core number of the nodes 
under three different cases, i.e., C„ > C„, C„ < C„, and 
Cu — Cy. Specifically, under the first case {Cu > C„), the 
algorithm first invokes Color(G, v, c) to find the nodes in 
Gt, (line 6 in Algorithm [T]i, because only the core number of 
the nodes in G„ may need to be updated. After this process, 
all the nodes in Gy are recorded in Vc and all of them are 
colored by 1. Then, the algorithm invokes RecolorInsert(G, 
c) to identify the nodes whose core numbers are definitely 
unchanged (line 7 in Algorithm [Til. After this step, all of such 
nodes in Vc are recolored by 0. Finally, the algorithm invokes 
UpdateInsert(G, c) to update the core number of the nodes in 
Vc with color 1 (line 8 in Algorithm [TJ. Similar process can be 
used for other two cases (line 9-13 in Algorithm [TJ. Note that 
for the case Cu = C^, we can invoke Color(G, u, c) to find the 
nodes in Guuv, because u can reach v after inserting an edge 
(u, v). Below, we describe the details of our sub-algorithms. 
Color, Recolorlnsert, and Updatelnsert, respectively. 

Recall that after inserting an edge {u, v), by the fc-core up- 
date theorem, we have three cases that need to be considered, 
i.e., Cu < Cy, Cu > Cv, and Cu = Cy To simplify our 







u. .1 vi^uiui^uLy/ — 1 / u. ^ then 

7: Xu ^ — Xu + 1; 

8: if X„ < c then 
9: color(ii) 0; 

10: flag 1; 

11: if flag = 1 then 

12: RecolorInsert(G, c); 



description, we mainly focus on describing our sub-algorithms 
under the case G„ = Cy, and similar description can be 
used for other cases. Suppose that node u and v have core 
number Cy = Cy = c. In this case, we have Vc = Kiu«- 
By Definition 13.21 finding the nodes in V^uv can be done 
by a Depth-First-Search (DFS) algorithm. Color depicted in 
Algorithm [2] is indeed such a DFS algorithm. In particular. 
Color will assign a color 1 to every node in Vc- At the 
beginning, Vc is initialized by an empty set and all the nodes 
are associated with a color 0. The algorithm recursively finds 
the nodes that are reachable from u and have core number c 
(line 6-7 in Algorithm O. When the algorithm visits such a 
node, if its color is 0, then the algorithm colors it by 1 and 
adds it into the set Vc (line 3-4 in Algorithm O. To find all 
the nodes in Vuuy, we can invoke Color(G, u, c). Recall that 
after inserting edge {u, v), the nodes that are reachable from 
V can also be found by Color(G, u, c). 

Recolorlnsert described in Algorithm [3] is used to identify 
the nodes in Vc whose core numbers are definitely unchanged. 
Specifically, Algorithm[3]recursively recolors the nodes whose 
core numbers do not change by a color 0. The recursion 
is terminated until no node needs to be recolored. In each 
recursion, the algorithm re-computes Xu for each node u in 
Vc- Here the recomputed X^ equals to the sum of the number 
of neighbors of node u whose core numbers are larger than c 
and the number of neighbors of node u with color 1 (line 4-7 
in Algorithm[3]l. For a node u, if the current Xy is smaller than 
or equal to c, then the algorithm recolors it by (line 8-10 in 
Algorithm [3]). 

The rationale of Algorithm [3] is as follows. First, Algo- 
rithm [3] assumes that the core numbers of all the nodes in Vc 
need to be updated. Then, for each node w in Vc, the algorithm 
recomputes Xy;. Initially, since all the neighbors of w whose 
core numbers equal to c are colored by 1, Xyj is indeed the 
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Algorithm 4 void UpdateInsert(G, c) 

1: for each node w £ Vc do 
2: if color(uO — 1 then 

3: C„^c + 1; 



same value as our previous definition. If X^, < c, then w at 
most c neighbors whose core numbers are larger than c after 
inserting an edge {u,v). As a result, C^, cannot be updated 
and the algorithm recolors it by 0. This recoloring process 
may affect the color of w's neighbors. The reason is because, 
before recoloring w, w may contribute to calculate X^, where 
z is a neighbor of w. Consequently, the algorithm needs to 
recursively recolor the nodes in Vc- Note that Algorithm [3] 
is recursively invoked at most \Vc\ + 1 times, because the 
algorithm at least recolors one node at a recursion in the worse 
case. The following theorem shows that after Algorithm [3] 
terminates, a node with a color 1 is a sufficient and necessary 
condition for updating its core number 

Tlieorem 3.2: Under the case of insertion of an edge {u,v), 
the core number of a node needs to be updated if and only if 
its color is 1 after Algorithm |5] terminates. 
Proof: First, we prove that if the core number of a node 
w needs to be updated, then its color is 1 after Algorithm [3] 
terminates. We focus on the case of C„ = C„ = c, similar 
proof can be used to prove the other two cases. By our 
assumption and Lemma 13.51 we have w G Vc, where Vc = 
Vuvjv Then, by Lemma [331 after inserting an edge {u,v), 
the core number of the nodes in Vc increases by at most L 
Therefore, if Cw needs to be updated, then the updated core 
number of w must be c + 1. That is to say, node w must have 
c+1 neighbors whose core numbers are larger than or equal to 
c + 1. Now assume that the color of node w is 0. This means 
that < c when Algorithm |3] terminates. Recall that X^ 
denotes to the sum of the number of neighbors whose core 
numbers are larger than c and the number of neighbors whose 
color is I. This implies that node w has at most c neighbors 
whose core numbers are larger than c, which is a contradiction. 

Second, we prove that if a node has a color 1 after 
Algorithm |3] terminates, then the core number of this node 
must be updated. We consider the induced subgraph by the 
nodes with color 1 after Algorithmic] terminates and the nodes 
whose core numbers are greater than c. Consider a node w 
in such an induced subgraph. Clearly, if w has a color 1, 
then it has X^ > c neighbors. And if w has a color 0, then 
its core number Cu, is larger than c. By Definition 12.11 the 
induced subgraph belongs to the (c + l)-core. Therefore, the 
core number of a node w with color 1 is at least c+1. By 
Lemma [331 after inserting an edge {u, v), the core number of 
any nodes in graph G increases by at most 1. Consequently, 
the core number of the nodes with color 1 increases by 1 . This 
completes the proof. □ 

Updatelnsert outlined in Algorithm |4] increases the core 
numbers of the nodes in Vc with label 1 to c+1, because only 
the core numbers of those nodes need to increase by 1 after 
the coloring and recoloring processes. The correctness of our 
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Fig. 3. A graph after inserting an edge (i)g, uio). 

algorithm for edge insertion can be guaranteed by Theorem l3.1l 
and Theorem 13.21 The following example explains how the 
Insertion algorithm works. 

Example 3.3: Let us consider the same graph given in Fig. [T] 
Assume that we insert an edge {vs,via), which results in a 
graph given in Fig. |3] In Fig. |3] the dashed line denotes the 
inserted edge. Since C^g = Cv-^„ = c = 2, the Insertion 
algorithm first invokes Color(G, vs, 2). After this process, 
we can get that Vc {ws, Vio, '^9, '"2, '^1, t'ls, ^ii, wi2, "is}- 
And all the nodes in Vc are colored by 1 and the other 
nodes are colored by 0. Then, the algorithm invokes the 
RecolorInsert(G, 2) algorithm. To simplify our description, 
we assume that the node visiting-order in Vc is their DFS 
visiting-order At the first recursion, we can find that X„j ~ 2, 
thereby it is recolored by 0. Also, the node V12 is recolored 
by 0, because X^-^^ ~ 2. At the second recursion, we can find 
that the nodes vu and W13 are recolored by 0. At the third re- 
cursion, no node needs to be recolored, the algorithm therefore 
terminates. After invoking the RecolorInsert(G, 2) algorithm, 
the nodes {ug, vio, vq,V2, wis} are colored by 1, thereby their 
core numbers must increase to 3 by Theorem l3.2l Finally, the 
Insertion algorithm invokes the UpdateInsert(G, 2) algorithm 
to update the core number of such nodes. As a consequence, 
the core number of the nodes {vs, wio, vg, V2, wis} is increased 
to 3. □ 

We analyze the time complexity of the Insertion algorithm 
as follows. First, the Color algorithm takes 0{J2uev 
time complexity. Second, the Recolorlnsert algorithm takes 
'^(l^cl J2uev ^u) time complexity in the worse case, as the 
algorithm is recursively invoked at most 0(|Vc|) times and 
each recursion takes 0(X)tiev ^«) complexity. Finally, 
the Updatelnsert algorithm takes 0(1141) time complexity. 
Put it all together, the time complexity of the Insertion 
algorithm is 0(1^1 Yluev ™ '■^^ worse case, which is 
independent of the graph size. However, in practice, the algo- 
rithm is more efficient than such worse-case time complexity. 
The reason could be of twofold. On the one hand, | Vc \ typically 
not very large w.rt. the number of nodes of the graph. On the 
other hand, very often, the Recolorlnsert algorithm terminates 
very fast. 

Algorithm for edge deletion: The main algorithm for edge 
deletion, namely Deletion, is outlined in Algorithm |5] Similar 
to the edge insertion case. Deletion also includes three sub- 
algorithms: Color, RecolorDelete, and UpdateDelete. Here 
Color is used to find the nodes in the induced core subgraph, 
RecolorDelete is utilized to identify the nodes whose core 
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Algorithm 5 Deletion(G, u, v) 

Input: Graph G = {V, E) and an edge [u, v) 
Output: the updated core number of the nodes 
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numbers need to be updated, and UpdateDelete is applied 
to update the core numbers of the nodes identified by Re- 
colorDelete. The detailed description of Deletion is given as 
follows. 

Similarly, let Vc be a set of nodes whose core numbers may 
need to be updated. First, Algorithm |5] initializes the color of 
all the nodes to and Vc to an empty set. Likewise, under 
the edge deletion case, we also have to consider three cases. 
That is, Cu > Cy, Cu < Gu, and C„ = C„. If C„ > C„, only 
the core number of the nodes in Gy may need to be updated. 
Under this case, the algorithm invokes Color(G, v, c) to find 
the nodes in Vc (line 6 in Algorithm |5]l. Then, the algorithm 
invokes RecolorDelete(G, c) to identify the nodes whose core 
numbers need to be changed (line 7 in Algorithm |5]l. Finally, 
the algorithm invokes UpdateDelete(G, c) to update the core 
number of such nodes (line 8 in Algorithm |5]i. Similar process 
can be used for the G„ < Gy case. For the Gy = Gy case, the 
algorithm first invokes Color(G, u, c) to find the nodes in G„ 
(line 16 in Algorithm |5]l. Then, the algorithm has to handle 
two different cases. First, if u can reach v, then the coloring 
algorithm can also find the nodes in Gy (in this case, u's color 
is 1, line 22 in Algorithm |5]l. Second, if u cannot reach v (v's 
color is 0), then the algorithm invokes Color(G, v, c) to find 
the nodes in Gy (line 19 in Algorithm |5]l. After this process, 
all the node in G„ui' are recorded in Vc- Then, we can invoke 
RecolorDelete(G, c) and UpdateDelete(G, c) algorithms to 
update the core number of the nodes in Vc- Below, we give 
the detailed descriptions of RecolorDelete and UpdateDelete 
respectively. 

Similar to the edge insertion case, after invoking Color, 



Algorithm 6 void RecolorDelete(G, c) 

1: flag ^ 0; 

2: for each node u £ Vc do 
3: if color(u) = 1 then 
4: Xu 0; 

5: for each node w G N(u) do 

6: if (color(M) = 1) or (Cw > c) then 

7: Xu ^ — Xu + 1; 

8: if Xu < c then 

9: color(M) -h- 0; 

10: flag ^ 1; 

11: if flag = 1 then 
12: RecolorDelete(G, c); 



Algorithm 7 void UpdateDelete(G, c) 

1: for each node lu G Vc do 
2: if color(uO — then 

3: Cu, c - 1; 



the nodes whose core numbers may need to be updated 
are recorded in a set Vc, and also all of them are colored 
by 1. After obtaining the set Vc, RecolorDelete described 
in Algorithm |6] is used to determine the nodes whose core 
numbers must be updated. In particular, RecolorDelete re- 
cursively recolors the nodes whose core numbers need to be 
updated by 0. In each recursion, the algorithm calculates Xyj 
for every node w in Vc- Here Xyj denotes the sum of the 
number of w's neighbors whose color is 1 and the number of 
w's neighbors whose core numbers are larger than c, where 
c = mm{Gu, Gy}- For a node w G Vc, if X^, < c, then the 
algorithm colors w by 0. The algorithm terminates if no node 
needs to be recolored. Clearly, the algorithm is invoked at 
most \Vc\ times. The following theorem shows that a node in 
Vc with a color after Algorithm |6] terminates is a sufficient 
and necessary condition for updating its core number 

Theorem 3.3: Under the case of deletion of an edge {u,v), a 
node in Vc whose core number needs to update if and only if 
its color is after Algorithm |6| terminates. 
Proof: First, we prove that if a node w in Vc whose 
core number needs to be updated, then its color is after 
Algorithm |6] terminates. By our assumption and Lemma [3731 
after deleting an edge {u,v), Gy, decreases by L This means 
that Gy, decreases to c — L That is to say, w has c — 1 neighbors 
whose core numbers are larger than or equal to c — 1. Suppose 
that the color of is 1 after the algorithm terminates. This 
implies that Xy, > c. Recall that Xy, denotes the sum of the 
number of w's neighbors whose core numbers are lager than c 
and the number of w's neighbors whose color is L Note that 
a node with color 1 suggests that its core number equals to c. 
As a result, w has at least c neighbors whose core numbers 
are larger than or equal to c, which is a contradiction. 

Second, we prove that if a node w in Vc is recolored by 
after Algorithmic] terminates, then Gy, must be updated. After 
deleting an edge {u, v), we construct an induced subgraph, 
which is denoted as G = (V, E), by the nodes in Vc and the 
nodes whose core numbers are larger than c. Note that the core 
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number of the nodes in V\V is smaller than c. Therefore, they 
do not affect the core number of the nodes in V. If a node 
w € Vc with a color after Algorithm |6] terminates, then 
< c. This suggests that the node w in G has at most 
c — 1 neighbors. By Definition 12.11 G at most belongs to the 
(c — l)-core. By Lemma [331 the core number of any nodes 
in G decreases by at most 1 after deleting an edge. Therefore, 
the core number of the nodes with color decreases by 1. 
This completes the proof. □ 
UpdateDelete which is depicted in Algorithm |7] is used to 
update the core number of the nodes in Vc with color to c — 1, 
because only the core numbers of those nodes need to decrease 
by 1 after the coloring and recoloring steps. The correctness of 
Deletion can be guaranteed by Theorem [TT] and Theorem l3.3l 
By a similar analysis as the edge insertion case, the time 
complexity of Deletion(G, u, v) is 0(|Vc|X]uey -^")- 
following example explains how Deletion works. 

Example 3.4: Let's consider the graph depicted in Fig. |3] 
Suppose that we delete the edge (wg, uio)- Since C^g — C^io ~ 
c = 3, the Deletion algorithm first invokes Color(G, us, 3), 
which results in Vc ~ {vs}- Clearly, the color of wio is after 
this process ends. Hence, the algorithm invokes Color(G, uio, 
3), which leads to Vc = {vs,vio,vq,V2,vis}. After this pro- 
cess, all the nodes in Vc are colored by 1 and other nodes are 
colored by 0. Then, the algorithm invokes RecolorDelete(G, 
3). At the first recursion, since X^^ = 2, vs is recolored 
by 0. Similarly, vio,vg,V2, and uig will be recolored by 
at the first recursion. At the second recursion, the algorithm 
terminates because no node needs to be recolored. Therefore, 
all the nodes in Vc are recolored by 0. Finally, the algorithm 
invokes UpdateDelete(G, c) to decrease the core number of 
all the nodes in 14 to 2. □ 

B. Pruning strategies 

As analysis in the previous subsection, the time complexity 
of our Insertion and Deletion algorithms depend on the size 
of Vc- In this subsection, to further accelerate our algorithms, 
we devise two pruning techniques, namely X-pruning and Y- 
pruning, to remove the nodes in Vc whose core numbers are 
definitely unchanged given the graph is updated. 

X-pruning: By Lemma 13.11 for a node w, is an upper 
bound of Cw Here we make use of such upper bound to 
develop pruning technique. We refer to it as X-pruning. Below, 
we discuss the X-pruning technique over the edge insertion 
and edge deletion cases, respectively. 

First, we consider the insertion case. Assume that we insert 
an edge (uq, vq). Also, we need to consider three cases, Cuo > 
Cvo^ Cuo < ^UQ, Gua = GijQ. Below, we mainly focus on 
describing the X-pruning rule under the case of Cug = G-ug, 
and similar descriptions can be used for other two cases. For a 
node w in Vc, after inserting an edge {uq, vq), if equals to 
c, then Gw cannot increases to c+ 1. As a result, we can safely 
prune w. For example, consider an graph in Fig. [3] Assume 
that we insert an edge {vg,,vio). Then, for the node vi, we 




Fig. 4. A toy induced core subgraph. 



have = 2. Clearly, G„j cannot increase to 3, thereby we 
can prune vi. 

In effect, after removing w, for the nodes that cannot be 
reachable from uq and vq in the induced core subgraph can 
also be pruned. Let us consider a toy induced subgraph shown 
in Fig.|4] Suppose that the induced subgraph can be partitioned 
into three parts. Si, w, and 5*2. Further, we assume that both 
Uq and Vq are in Si, and X^j = c. Recall that after inserting 
an edge {uo,vo), if X^, = c, then Gw is unchanged. By 
Lemma [331 w will not affect the core numbers of the nodes 
in 5*2. As a consequence, the core numbers of the nodes in S2 
cannot be increased, and we can safely prune all the nodes in 
S2- More formally, we give a pruning theorem as follows. 

Theorem 3.4: Given a graph G and an edge {uq,vo). After 
inserting an edge (uq, vq) in G, for a node w £ Vc and X^y < 
c + 1, we have the following pruning rules. 

• If > Gvg (i.e., Vc = Vyg), then for any node u £ Vc 
that every path from wq to u in Gy„ must go through w 
can be pruned. 

• If Gua < (i-S-' Vc = Vua)' then for any node u E Vc 
that every path from uq to u in G„„ rnust go through w 
can be pruned. 

• If GuQ — GvQ (i.e., Vc — KioUi'oA f'^^" for any node 
u £ Vc that every path either from uq to u or from vq to 
u in GuqUvo must go through w can be pruned. 

Proof: We prove this theorem under the case G„„ = G^^, 
and similar arguments can be used to prove the other two 
cases. After inserting an edge {uo,vq), by Lemma [331 the 
core number of every node in Vc increases by at most L As a 
result, after an edge (uq, uq) insertion, for a node w in Vc, if 
Xu, < c + 1, then Gw will not increase. G^ does not change 
implying that w is still in the c-core after inserting an edge 
(wqjWo)- Clearly, it does not affect those nodes in Vc whose 
core numbers will increase to c + L Therefore, we can safely 
remove the node w from Vc- After removing w, for any node 
u € Vc\{w} that cannot be reached from uq or uq, we also 
can safely remove it from Vc- The reason is because only the 
core number of the nodes that are reachable from uq or uo may 
need to be updated. As a consequence, for any node u e Vc 
such that every path either from uq to u or from to u must 
go through w can be pruned. This completes the proof. □ 
Based on Theorem 13.41 we can prune certain nodes in the 
coloring procedure (the Color algorithm). We present our new 
coloring algorithm with X-pruning in Algorithm [Sj The new 
coloring algorithm is still a DFS algorithm. The algorithm first 
calculates Xu when it visits a node u (line 2-5 in Algorithm[8]). 
Based on Theorem l3.4l the DFS algorithm can early terminate 
if it visits a node u such that Xu < c. The reason is that we 
can safely remove such a node u from V by Theorem 13.41 
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Algorithm 8 void XPruneColor(G, u, c) 

1: visited(ii) 1; 
2: ^ 0; 

3: for each node w G N{u) do 

4: if Cw > c then 

5: A'„ ^ A'„ + 1; 

6: if Xu > c then 

7: if color(M) = then 

8: 14 ^ 14 U {u}; 

9: color(ii) = 1; 

10: for each node w G N{u) do 
11: if visited(TO) — and Cw ~ c then 
12: XPruneColor(G, c); 



Hence, the algorithm does not need to recursively visits its 
neighbors. If Xu > c, the algorithm adds node u into 14 
and color it by 1 (line 7-9 in Algorithm [8]l. And then, the 
algorithm recursively finds us neighbors in 14 (line 10-12 
in Algorithm |8]l. To implement this pruning strategy, we can 
replace the Color algorithm with the XPruneColor algorithm 
in Algorithm [T] 

Second, we consider the edge deletion case. Suppose that 
we delete an edge {uq, vq) from graph G and the core numbers 
of all the nodes in 14 are c. We consider three different cases: 
(1) Cu„ > a,„, (2) Cu„ < Cy„, and (3) C„„ = a„. For 
> Cvg, we only need to find the nodes in Gv„, because 
the deletion of edge (uo, va) does not affect the core number 
of the nodes in G„„ . Recall that after deleting an edge, the core 
number of the nodes in 14 decreases by at most 1 . Therefore, 
after deleting an edge {uq,vo), if X-^g > c, then vq's core 
number will not be changed. This is because X^^ > c implies 
Vq has at least c neighbors whose core numbers are larger than 
or equal to c. That is to say, the core number of node vq is 
still c. Since vq's core number does not change, we do not 
need to update the core number of the nodes in Gy„. As a 
result, under the case of C„ > in Algorithm |5] (line 4 in 
Algorithm |5]i, we can first compute Xy. If X^ > c, we do 
nothing. Symmetrically, for C„„ < Cv„, we have a similar 
pruning rule as the case of Gu^ > Gy„. Also, for Gu,, = 
Gi,g, we first compute Xu^ and Xy^. If Xua < c, then we 
need to update the core number of the nodes in Gu,,. Also, 
if Xyg < c, we update the core number of the nodes in Gy^, . 
For the case that Xug > c and Xy^ > c, we do nothing, 
because no node's core number needs to be updated. It is 
worth mentioning that X^o and Xy^ are computed based on 
the core numbers of the nodes that have not been updated. The 
detailed algorithm with X-pruning for the edge deletion case 
is outlined in Algorithm |9] We can use the XPruneDeletion 
algorithm to replace the Deletion algorithm. The following 
example illustrates how this algorithm works. 

Example 3.5: Let us reconsider the example given in Fig. |3] 
Assume that we delete the dashed line (edge (wg, wio)). In this 
case, the core number of vs and vio is 3. That is, c = 3. Then, 
we can calculate that Xy^ = 2 and ^i.^, = 2. Because vs has 
two neighbors {v^ and vq) whose core number is 4 and vio has 
two neighbors (vg and vis) whose core numbers are 3. Since 



Algorithm 9 XPruneDeletion(G, u, v) 

Input: Graph G = {V, E) and an edge (u, v) 
Output: the updated core number of the nodes 

1: Initialize visited(ui) *r- for all node w £V; 

2: Initialize colorC™) -s— for all node w £V; 

3: 14 ^ 0; 

4: Compute Xu\ 

5: Compute X^,; 

6: if C„ > then 

7: c^C„; 



8: if X„ < c then 

9: Color(G, V, c); 

10: RecoIorDeleteCC c); 

11: UpdateDelete(G, c); 

12: if Cu < Cv then 

13: c^Cu, 

14: if X„ < c then 

15: Color(G, u, c); 

16: RecolorDelete(G, c); 

17: UpdateDelete(G, c); 

18: if Cu = Cv then 

19: c ^ Cu, 

20: if Xu < c and X„ < c then 

21: Color(G, u, c); 

22: if color(t;) = then 

23: Initialize visited(TO) for all node w G V; 

24: Color(G, V, c); 

25: RecolorDelete(G, c); 

26: UpdateDelete(G, c); 

27: else 

28: RecolorDelete(G, c); 

29: UpdateDelete(G, c); 

30: if X„ < c and Xu > c then 

31: Color(G, u, c); 

32: RecoIorDelete(G, c); 

33: UpdateDelete(G, c); 

34: if Xu > c and X„ < c then 

35: CoIor(G, V, c); 

36: RecolorDeIete(G, c); 

37: UpdateDelete(G, c); 



Xyg < c and Xy-^„ < c, we need to update the core number of 
the nodes in Gy^ and Gy-^„. After invoking Algorithmic] we 
can find that the core number of nodes {wg, wio, ug, W2, 1'ls} 
decreases to 2. n 

y-pruning: For a node w, Yyy is a lower bound of Gy, by 
Lemma 13.11 Here we develop pruning technique using such 
lower bound, and we refer to this pruning technique as Y- 
pruning. 

To illustrate our idea, let us reconsider the toy induced core 
subgraph shown in Fig. |4] which includes three parts, ^i, w, 
and 5*2. Suppose that we insert or delete an edge (uo,wo). 
Below, we focus on the case of Guq = Cyg — c, and similar 
descriptions can be used for other two cases. Further, we 
assume that both uq and vq are in 5*1, and Yw = c. First, 
we consider the insertion case, i.e., an edge {uq, vq) insertion. 
In this case, we claim that the core number of the nodes in 
52 are unchanged. The reason is as follow. Let u in 52 be a 
neighbor node of w. Then, for any neighbor u, we have Yy < c 
(if not, u and w will be in a (c + l)-core). This implies that 



9 



for each neighbor of w in S2, the core number cannot increase 
to c + 1 after inserting (uo,wo)- As a result, the core numbers 
of all the nodes in ^2 will not change after inserting (wq, vq)- 
Second, for the deletion case, if we delete an edge (wo,wo), 
Cw still equals to c because w has c neighbors whose core 
numbers are larger than c {Yu, = c). Clearly, the core numbers 
of the nodes in S2 are also unchanged. Put it all together, 
under both edge insertion and edge deletion cases, the core 
numbers of all the nodes in 5*2 will not change, and thereby 
we can safely prune the nodes in S2- Formally, for F-pruning, 
we have the following theorem. 

Theorem 3.5: Given a graph G and an edge [uq^vq). After 
inserting/deleting an edge (mqjWo) in G, for a node w G Vc, 
if Yyj ~ c, then we have the following pruning rules. 

• If > Gyg (i.e., Vc = Vyg), then for any node u G Vc 
and u ^ w that every path from vq to u must go through 
w can be pruned. 

• If < C^jQ (i.e., Vc = Vuo), then for any node u ^ Vc 
and u ^ w that every path from uq to u must go through 
w can be pruned. 

• If Guo = Gy„ (i.e., Vc = Vuouvo)' then for any node 
u G Vc and u ^ w that every path either from uq to u 
or from vq to u must go through w can be pruned. 

Proof: We prove this theorem under the case Guq = G^^, and 
for other cases, we have similar proofs. Below, we discuss 
the proofs for the edge insertion and edge deletion cases, 
respectively. 

First, we prove the edge insertion case. Let Vyc be a set 
of nodes whose core numbers are larger than c. Assume that 
we remove w from Vc- Then, after removing w, we denote 
a set of nodes in Vc that cannot be reachable either from uo 
or from as Vi. Then, after inserting an edge (mqjWo). we 
consider two cases: (1) tu's core number will not change, and 
(2) w's core number increases by 1. The first case suggests 
that w is still in the c-core, and we can safely remove w from 
Vc- Therefore, for the nodes in Vi, we can also remove them 
from Vc, because only the core number of those nodes that are 
reachable from mo or vq may need to be updated. Second, we 
consider the case that w's core number increases by 1 after 
inserting an edge [u^^vq). We denote a subset of nodes in 
Vc whose core numbers increase by 1 as Vc after inserting 
an edge (ito,wo)- Further, we denote a subset of nodes in Vi 
whose core numbers need to increase by 1 as V2. In other 
words, V2 = n Vc. Clearly, the theorem holds if V2 = 0. 
Now we prove this by contradiction. Specifically, we assume 
that V2 7^ 0. By definition, after inserting an edge (ug, vq), the 
induced subgraph by the nodes in 14 IJ ^>c forms a (fc + 1)- 
core. We denote such subgraph as G" = [V , E'), where V' ~ 
Vc U Vyc- Clearly, all the nodes in G' has at least a degree 
c + 1. Now consider a subgraph G* induced by the nodes 
in V2 U{''^} U ^>c- We claim that all the nodes in G* has at 
least a degree c + 1. First, for the nodes in Vyc, their degree 
is obviously greater than c + 1 w.r.t. G*. Second, we consider 
the nodes in V2. By definition, in graph G', there is no edge 
between the nodes in V2 and the nodes in V"c\{V2 IJI'^}}- 



Algorithm 10 void YPruneColor(G, u, c) 

1: visited(it) <— 1; 
2: if color(u) = tiien 
3: Vc ^ K U {u}; 
4: color(u) = 1; 
5: y„ ^ 0; 

6: for each node w G A'^(ii) do 
7: it Cw > c tlien 

8: y„ ^ y; + 1; 

9: if y„ < c tiien 

10: for each node w £ N{u) do 

11: if visited(ti;) = and Cw ~ c then 

12: YPruneColor(G, w, c); 



Since the nodes in V2 have at least a degree c + 1 w.rt. graph 
G', they also have at least a degree c+1 w.rt. graph G*. Third, 
we consider the node w. On the one hand, we claim that w 
has at least one neighbor in V2. Suppose w has no neighbor in 
V2, then the nodes in V2 whose core numbers cannot increase 
to c + 1 after inserting an edge (uq, wq) by the fc-core update 
theorem, which contradict to our assumption. Hence, w has at 
least one neighbor in V2. On the other hand, since Y^, = c, 
w has c neighbors whose core numbers are larger than c. As 
a result, w has at least a degree c+1 w.rt. graph G*. Put it 
all together, all the nodes in G* have at least a degree c + 1. 
Note that by our definition the induced subgraph G* does not 
contain node uq and vq. Consequently, before inserting the 
edge (mojWo). the core number of the nodes in G* at least 
c+1. That is to say, the nodes in V2 has core number c + 1 
before inserting the edge (uo,uo), which is a contradiction. 
This completes the proof for the edge insertion case. 

For the edge deletion case, after deleing an edge (uo,wo), 
the core number of all the nodes in Vc decreases by at most 
1 according to Lemma 13.31 Hence, if a node w G Vc has 
Yw — c, then w's core number will not decrease. Similarly, let 
V>c be a set of nodes whose core numbers are larger than c. 
And assume that we remove w from Vc. Then, after removing 
w, we denote a set of nodes in Vc that cannot be reachable 
either from uq or from uo as Vi. Now consider a subgraph 
G* induced by the nodes Vi IJ{^}U^>c- We claim that all 
the nodes in such subgraph have at least a degree c. First, for 
the nodes in Vyc, their degree is clearly larger than c w.rt. 
G* because their core numbers are larger than c. Second, tu's 
degree is at least c w.r.t. G*, because w has c neighbors whose 
core numbers are larger than c. Third, for the nodes in Vi, 
their degree is also at least c w.r.t. G* . The rationale is as 
follows. By definition, no edge in G goes through the nodes 
in V"c\{Vi U{w}} and the nodes in Vi. Since the core number 
of the nodes in Vi is c, the nodes in Vi has at least c neighbors 
w.r.t. G*. Consequently, the core number of the nodes in G* 
is still c after removing the edge (mo,wo). This implies that 
the nodes in Vi can be pruned, which completes the proof for 
the edge deletion case. □ 

Based on Theorem 13.51 we can implement the F-pruning 
strategy in the coloring procedure. We present our new color- 
ing algorithm with F-pruning in Algorithm (TO] which is also 
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a DFS algorithm. In particular. Algorithm [TO] first colors a 
node u by 1 and adds it into Vc when it visits u (line 2-4 
in Algorithm [TOb. Then, the algorithm calculates Yu (line 5-8 
in Algorithm [TOl i. If = c, then the algorithm can early 
terminate. The reason is because the nodes that cannot be 
reachable from uq or vq after removing u can be pruned 
by Theorem 13.51 If Y^ < c, the algorithm recursively finds 
u's neighbors in Vc (line 9-12 in Algorithm [TOli. Below, we 
discuss how to integrate the YPruneColor algorithm into the 
Insertion and Deletion algorithm. 

First, to integrate the YPruneColor algorithm into the 
Insertion algorithm, we need to replace the Color algorithm 
with the YPruneColor algorithm as well as handle the fol- 
lowing special case. That is, if Cu„ — C„„ = c, Y^g = c 
and Yyg < c, we need to invoke YPruneColor(G, vo, c). If 
Cuo = C'l'o — Yug < c and Y^^ ~ c, we need to invoke 
YPruneColor(G, uq, c). The reason is because we need to 
allow the DFS algorithm to go through the edge (wo,wo) in 
order to add both uq and vq into Vc- If Cuq — Cv„ — c and 
Yuo ~ Yv„ = c, then we have to invoke both YPruneColor(G, 
Uq, c) and YPruneColor(G, vq, c) so as to add both uq and vo 
into Vc- Second, to integrate the YPruneColor algorithm into 
the Deletion algorithm, we only need to replace the Color 
algorithm with the YPruneColor algorithm. The following 
example illustrates how the YPruneColor algorithm works. 

Example 3.6: Consider an example in Fig. [3] For the edge 
insertion case, we assume that the edge (wg, viq) is the inserted 
edge. Since vs = viq = c = 2 and Y^^ = 2, we invoke 
YPruneColor(G, vio, 2). The algorithm first colors viq by 1 
and adds it into Vc- Then, the algorithm colors node vg by 1 
and adds it into Vc- Since F„g = 2, the recursion terminates at 
wg and returns to vio- Similarly, when the algorithm visits 
node V2, the recursion also terminates as = 2. As a 
result, the node vi is pruned. Finally, we can obtain Vc ~ 
{wio,U8,W9,V3,wi8,wii,ui2,wi3} after the algorithm ends. 

For the edge deletion case, we also assume that we delete an 
edge (wg, wio) from G. Under this case, we have vg ~ viq ~ 
c = 3. Since no node in Vc has Y^ = c, the F-pruning cannot 
prune any node. Suppose that the edge (u8,wio) is deleted. 
Then, we have wg = I'lo = c = 2. Under this case, assume that 
we further delete an edge {vg, wio). Then, we can find that the 
set Vc contains nodes {vg, W2, wi, wio, vis, wn, wi2, wis}- Since 
= c = 2, the node vi can be pruned by the YPruneColor 
algorithm. □ 

Combination of X-pruning and F-pruning: Here we dis- 
cuss how to combine both X-pruning and F-pruning for 
edge insertion case and edge deletion case, respectively. For 
edge insertion case, we can integrate both X -pruning and 
y-pruning into the coloring procedure. Specifically, in the 
coloring procedure, when the DFS algorithm visits a node u, 
we calculate both Xu and y„. Then, we use the X -pruning 
rule to determine the color of node u, and make use of 
both X-pruning and F-pruning rules to determine whether 
the algorithm needs to recursively visits u's neighbors or not. 
For edge insertion, the detailed coloring algorithm with both 



Algorithm 11 void XYPruneColor(G, u, uq, c) 

1: visited(ii) 1; 
2: Xu ^ 0; 
3: Yu ^ 0; 

4: for each node w G 7V(ii) do 

5: if Cm > c then 

6: Xu <— Xu + 1; 

7: it u uo and Cw > c then 

8: K K + 1; 

9: if > c then 
10; if F„ < c or c = then 
11: for each node w £ N{u) do 
12: if visited(TO) — and Cw — c then 

13: XYPruneColor(G, w, uo, c); 

14: if color(it) = then 
15: Vc ^VcU {u}- 

16: color(u) = 1; 



X-pruning and F-pruning, called XYPruneColor, is outlined 
in Algorithm |8] 

For the edge deletion case, we can easily integrate both 
X-pruning and y-pruning via the following two steps. First, 
we replace the Color algorithm in Deletion with the YPrune- 
Color algorithm. Second, we integrate the X-pruning rule into 
the Deletion algorithm. First, we replace the Color algorithm 
in XPrunDeletion with the YPruneColor algorithm. Second, 
we use this XPrunDeletion algorithm to replace the Deletion 
algorithm. 

IV. Experiments 

In this section, we conduct comprehensive experiments to 
evaluate our approach. In the following, we first describe our 
experimental setup and then report our results. 

A. Experimental setup 

Different algorithms: We compare 5 algorithms. The first 
algorithm is the baseline algorithm, which invokes the 0(n + 
m) algorithm to update the core number of nodes given the 
graph is updated [20]. We denote this algorithm as algorithm 

B. The second algorithm is our basic algorithm without 
pruning strategies, which is denoted as algorithm N. The third 
algorithm is our basic algorithm with X-pruning, which is 
denoted as algorithm X. The fourth algorithm is our basic 
algorithm with F-pruning, which is denoted as algorithm Y. 
The last algorithm is our basic algorithm with both X-pruning 
and F-pruning, which is denoted as algorithm XY. 

Datasets: We collect 15 real-world datasets to con- 
duct our experiments. Our datasets are described as fol- 
lows. (1) Co-authorship networks: we download four 
physics co-authorship networks from Stanford network 
data collections [19] which are HepTh, HepPh, As- 
troph, and CondMat datasets. In addition, we also ex- 
tract a co-authorship network from a subset of the DBLP 
dataset (www . inf ormatik . uni-trier . de/ ~ley/ db^ 
with 78,649 authors. (2) Online social networks: we col- 
lect the Douban Owww . douban . comb dataset from ASU 
social computing data repository [24], and collect the 
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TABLE I 
Summary of the datasets 



Name 


#nodes 


#edges 


Ref. 


Description 


HepTh 


9,877 


51,946 


[19] 




HepPh 


12,008 


236,978 


[19] 


Co-authorship 


Astroph 


18,772 


396,100 


[19] 


networks 


CondMat 


23,133 


186,878 


[19] 




DBLP 


78,649 


382,294 


website 




Douban 


154,908 


654,324 


[24] 




Epinions 


75,872 


396,026 


[19] 


Online 


Slashdotl 


77,360 


826,544 


[19] 


social 


blasndotz 


82,168 


867,372 


[19] 


networks 


Wikivote 


5,311 


142,066 


[19] 




EmailEnron 


36,692 


367,662 


[19] 


Communication 


EmailEuAll 


265,182 


224,372 


[19] 


networks 


Gnutella 


62,586 


153,900 


[19] 


P2P networks 


Brightkite 


58,228 


428,156 


[19] 


Location based 


Gowalla 


196,591 


1,900,654 


[19] 


social networks 



Epinions ( |www . epinio ns . com), two Slashdot datasets 
(www . slashdot . orgi), and the Wikivote dataset from 
Stanford network data collections [19]. (3) Communication 
networks: we employ two Email communication networks, 
namely EmailEnron and EmailEuAll, from Stanford network 
data collections [19]. (4) P2P networks; we download a P2P 
network (Gnutella) dataset from Stanford network data collec- 
tions [19], which are originally collected from Gnutella [19]. 
(5) Location-based social networks (LBSNs): We download 
two notable LBSNs datasets from Stanford network data 
collections [19]. For all the datasets, if the graph is a directed 
graph, we ignore the direction of the edges in the graph. The 
detailed statistical information of our datasets are described in 
Table U 

Experimental environment: We conduct our experiments 
on a Windows Server 2007 with 4xDual-Core Intel Xeon 
2.66 GHz CPU, and 128G memory. All the algorithms are 
implemented by Visual C++ 6.0. 

B. Results for single edge updates 

For all the experiments, we randomly delete and insert 500 
edges in the original datasets. After inserting/deleting an edge, 
we invoke 5 different algorithms to update the core number of 
the nodes, respectively. For all the algorithms, we record the 
average time to update the core number of nodes over 500 edge 
insertions and 500 edge deletions. Specifically, we record three 
quantities, namely average insertion time, average deletion 
time, and average update time. We calculate the average inser- 
tion (deletion) time by the average core number update time of 
different algorithms over 500 edge insertions (deletions). The 
average update time is the mean of average insertion time 
and average deletion time. To evaluate the efficiency of our 
algorithms (algorithm N, algorithm X, algorithm Y, algorithm 
XY), we compare them with the baseline algorithm (algorithm 
B) according to the average insertion/deletion/update time. Our 
results are depicted in Table HI] 

From Table [III we can clearly see that all of our algorithms 
(algorithm N, algorithm X, algorithm Y, algorithm XY) per- 
form much better than the baseline algorithm (algorithm B) 
over all the datasets used. The best algorithm is the algorithm 



XY, which is our basic algorithm with both X-pruning and 
y-pruning, followed by algorithm X, algorithm Y, algorithm 
N, and algorithm B. Over all the datasets used, the maximal 
speedup of our algorithms is achieved in Gowalla dataset 
(the last row in Table Hill. Specifically, in Gowalla dataset, 
algorithm XY, algorithm X, algorithm Y and algorithm N 
reduce the average update time of algorithm B by 101.8, 
81.7, 62.3, and 56.2 times, respectively. The minimal speedup 
of our algorithms is achieved in HepTh dataset (the first 
row in Table Hill. In particular, in HepTh dataset, algorithm 
XY, algorithm X, algorithm Y and algorithm N reduce the 
average update time of algorithm B by 3.2, 3.0, 2.3, and 2.2 
times respectively. In general, we find that the speedup of our 
algorithms increases as the graph size increases. The reason is 
because the time complexity of the baseline algorithm is linear 
w.r.t. the graph size for handling each edge insertion/deletion. 
Instead, the time complexity of our algorithms is independent 
of the graph size, and it is only depends on the size of the 
induced core subgraph. Additionally, over all the datasets, we 
can observe that our basic algorithm with pruning techniques 
is significantly more efficient than the basic algorithm without 
pruning techniques. Below, we discuss the effect of the X- 
pruning and F-pruning techniques. 

The effect of pruning: Here we investigate the effective of 
our pruning techniques. From Table [III over all the datasets, 
we can see that the X-pruning strategy (algorithm X) is more 
effective than the F-pruning strategy (algorithm Y) according 
to average deletion/insertion/update time. For example, in 
HepTh dataset (row 1 in Table lUli, algorithm X reduces the 
average deletion time, the average insertion time, and the 
average update time, over algorithm N by 96.3%, 10%, and 
36.8%, respectively. However, in HepTh dataset, algorithm Y 
reduces the average deletion time, the average insertion time, 
and the average update time, over algorithm N by 6%, 3.1%, 
and 4.3%, respectively. This result indicates that the condition 
of the y-pruning is stronger than the condition of the X- 
pruning in many real graphs. Recall that by Theorem 13.51 if 
there is at least one node u with core number C„ and Yu = C„ 
in the induced core subgraph, then the F-pruning strategy 
may prune some nodes. The condition of F-pruning strategy 
(Yu = Cu) is strong, because if a node has C„ neighbors 
whose core number is larger than Cu, then this node may have 
another additional neighbor whose core number is larger than 
Cu, thus resulting in that the node m is in a (C„ + l)-core. 
Instead, indicating by our experimental result, the condition 
of the X-pruning strategy (X„ < C„ + 1) may be easily 
satisfied in real graphs. This result also implies that the lower 
bound of the core number in Lemma 13.11 (Y^) is typically 
very loose for many nodes in real graphs. In addition, we 
can observe that the algorithm with both X-pruning and Y- 
pruning strategies is more efficient than the algorithm with 
only one pruning strategy over all the datasets. Generally, we 
find that the X-pruning strategy under the edge deletion case 
is more effective than itself under the edge insertion case. 
Similarly, the F-pruning strategy under the edge deletion case 
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TABLE II 



Average update time of different algorithms (In last column, SR denotes the speedup ratio of XY). All time is millisecond. 



Time (ms) 


Average 


deletion time 




Average insertion time 


Average update time 


B 


N 


X 


Y 


XY 


B 


N 


X 


Y 


XY 


B 


N 


X 


Y 


XY 


SR 


HepTh 


2.38 


1.06 


0.54 


1.00 


0.48 


2.80 


1.32 


1.20 


1.28 


1.14 


2.59 


1.19 


0.87 


1.14 


0.81 


3.2 


HepPh 


4.12 


2.58 


1.30 


1.58 


1.20 


5.30 


1.46 


1.32 


1.40 


1.20 


4.71 


2.02 


1.31 


1.49 


1.20 


3.9 


Astroph 


9.14 


1.30 


0.36 


1.12 


0.32 


9.92 


1.56 


1.40 


1.42 


1.40 


9.53 


1.43 


0.88 


1.27 


0.86 


11.1 


CondMat 


5.94 


1.52 


0.64 


1.30 


0.60 


6.24 


1.50 


1.40 


1.36 


1.32 


6.09 


1.51 


1.02 


1.33 


0.96 


6.3 


DBLP 


12.08 


1.68 


1.26 


1.48 


1.22 


12.22 


1.52 


1.42 


1.44 


1.38 


12.15 


1.60 


1.34 


1.46 


1.30 


9.3 


Douban 


21.38 


4.58 


2.14 


3.28 


1.32 


21.16 


2.62 


2.02 


2.40 


2.00 


21.27 


3.60 


2.08 


2.84 


1.66 


12.8 


Epinions 


13.00 


2.06 


0.68 


1.62 


0.64 


13.94 


2.04 


1.56 


1.80 


1.50 


13.47 


2.05 


1.12 


1.71 


1.07 


12.6 


blasndotl 


22.53 


4.12 


1.43 


2.06 


1.38 


20.37 


2.80 


1.73 


1.88 


1.32 


20.45 


3.46 


1.58 


1.87 


1.35 


15.1 


Slashdot2 


24.36 


4.85 


1.56 


2.13 


1.54 


22.32 


2.93 


1.82 


2.05 


1.64 


23.34 


3.73 


1.69 


2.09 


1.59 


14.7 


Wikivote 


3.64 


1.32 


0.50 


0.50 


0.48 


4.06 


1.78 


1.70 


1.76 


1.42 


3.85 


1.55 


1.10 


1.13 


0.95 


4.1 


EmailEnron 


10.80 


2.40 


0.90 


1.82 


0.86 


10.60 


2.92 


2.70 


2.82 


2.68 


10.70 


2.66 


1.80 


2.32 


1.77 


6.0 


EmailEuAll 


13.06 


2.14 


1.24 


1.64 


1.22 


12.52 


1.74 


1.52 


1.70 


1.24 


12.79 


1.94 


1.38 


1.67 


1.23 


10.4 


Gnutella 


10.32 


2.64 


1.58 


1.66 


1.38 


12.08 


2.18 


2.06 


2.12 


1.82 


11.20 


2.41 


1.82 


1.89 


1.60 


7.0 


Brightkite 


13.60 


1.56 


0.64 


1.32 


0.54 


13.64 


1.64 


1.32 


1.34 


1.32 


13.62 


1.60 


0.98 


1.33 


0.93 


14.6 


Gowalla 


108.20 


2.10 


1.12 


1.82 


0.91 


107.52 


1.74 


1.52 


1.64 


1.21 


107.86 


1.92 


1.32 


1.73 


1.06 


101.8 



is more effective than itself under the edge insertion case. 
Taking the Gnutella dataset as an example (row 13 in Table HHi. 
for the edge deletion case, algorithm X reduces the average 
deletion time over algorithm N by 143.75%, while for the edge 
insertion case, algorithm X cuts the average insertion time 
over algorithm N only by 5.8%. For the edge deletion case, 
algorithm Y reduces the average deletion time over algorithm 
N by 59%, while for the edge insertion case, algorithm Y 
reduces the average insertion time over algorithm by 2.8%. 

C. Results for a batch of edge updates 

In previous experiments, we have shown the performance of 
our algorithms for core maintenance in a graph given the graph 
is updated by an edge insertion or deletion. These algorithms 
are extremely useful to continuously monitor the dynamics 
of the core number of the nodes in time-evolving graph. 
Besides the graph with a single edge update, here we show 
the performance of our algorithms in a dynamic graph given 
a batch of edges updates. Assume that the graph has r edge 
updates at a time interval At. To maintain the core number 
of the nodes, we need to sequentially invoke our algorithm 
(algorithm XY) r times. For the baseline algorithm (algorithm 
B), however, we can invoke it one time to recompute the 
core number of all nodes. Since our XY algorithm is the best 
algorithm for single edge updates, we only compare our XY 
algorithm with algorithm B. 

Now, let us focus on the last column in Tablellllwhich shows 
the speedup ratio (SR) of algorithm XY over algorithm B for 
a single edge update. In general, if r is less than the speedup 
ratio, then our algorithm is more efficient than the baseline 
algorithm for processing a batch of edge updates at a time 
interval At. For example, in Gowalla dataset, the speedup ratio 
of our algorithm is 101.8. As a result, if the graph has less than 
101 edge updates, i.e., r < 101, then our algorithm is more 
efficient than the baseline algorithm. However, if r is larger 
than the speedup ratio of our algorithm, the baseline algorithm 
is more preferable than our algorithm. As shown in Table Ull 
the speedup ratio of our algorithm increases as the graph size 



5000 




Fig. 5. Speedup ratio vs. graph size. 



increases. This result implies that, for a batch of edge updates, 
our algorithm is very efficient in large graphs with small r. In 
other words, if the graph is very large and evolves slowly, then 
our algorithm is more preferable. However, if the graph is very 
small and frequently varying, then the baseline algorithm is 
more efficient than our algorithm. Below, we show the speedup 
ratio of our algorithm in large synthetic graphs. 

To evaluate the speedup ratio of our algorithm in large 
graphs, we generate five large synthetic graphs based on a 
power-law random graph model [6]. Specifically, we produce 
five synthetic graphs Gi, • • ■ ,0^ with Gi has i million nodes 
and 5 X i million edges for z = 1, • • • , 5. Then, we adopt the 
same method used in our previous experiments to compute the 
speedup ratio of our algorithm. Fig. |5] shows that the result of 
speedup ratio of our algorithm with different graph size. From 
Fig. |5] we can see that the speedup ratio is greater than 4700 
when the graph size is 5 million nodes and 25 million edges. 
That is to say, in such a graph, if r is smaller than 4700, then 
our algorithm is more efficient than the baseline algorithm. 
Generally, for a fixed graph size (from 1 million to 5 million 
nodes), if r is below the red curve in Fig.|5] then our algorithm 
is more preferable than the baseline algorithm, otherwise the 
baseUne algorithm is more efficient. 
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V. Related work 

The k-core decomposition in networks has been extensively 
studied in the literature. In [22], Seidman introduces the con- 
cept of fc-core for measuring the group cohesion in a network. 
The cohesion of the fc-core increases as k increases. Recently, 
the fc-core decomposition in graph has been successfully used 
in many application domains, such as visualization of large 
complex networks [7], [9], [4], [3], [25], uncovering the 
topological structure of the Internet [10], [5], [2], analysis of 
the structure and function of the biological networks [17], [1], 
[23], studying percolation in random graph [14], [15], as well 
as identifying the influential spreader in complex network [18]. 
Below, we list some notable work on these applications. 

In [7], Batagelj et al. propose to use fc-core decomposition 
to visualize the large graph. Specifically, they first partition 
a large graph into smaller parts using the fc-core decomposi- 
tion and then visualize each smaller part by standard graph 
visualization tools. In [9], based on the fc-core decomposition, 
Baur et al. present a method for drawing autonomous systems 
graph using 2.5D graph drawing. Their algorithm makes use of 
a spectral layout technique to place the nodes in the highest 
order core. Then, the algorithm uses an improved directed- 
forces method to place the nodes in each fc core according to 
the decreasing order Alvarez-Hamelin et al. [4], [3] propose 
a visualization algorithm to uncover the hierarchical structure 
of the network using fc-core decomposition. Their algorithm 
is based on the hierarchical property of fc-core decomposition. 
More recently, Zhang and Parthasarathy [25] introduce a 
different notion, namely triangle fc-core, to extract the clique- 
like structure and visualize the graph. Unlike the traditional 
fc-core, the triangle fc-core is the maximal subgraph that each 
edge of the subgraph is contained within at least fc triangles. 
They also propose a maintenance technique for triangle fc-core. 
Since the triangle fc-core is totally different from fc-core, their 
maintenance technique cannot be applied in our problem. The 
fc-core decomposition is also successfully used for analyzing 
and modeling the structure of the Internet [10], [5], [2]. For 
example, in [10], Carmi et al. study the problem of mapping 
the Internet using the method of fc-core decomposition. In [5], 
Alvarez-Hamelin et al. investigate the hierarchies and self- 
similarity of the Internet using fc-core decomposition. Besides 
the Internet, the fc-core decomposition has also been applied to 
analyze the structure and function of the biological networks. 
In [17], Kitsak et al. propose a method based on the notion of 
fc-core to find the molecular complexes in protein interaction 
networks. Altaf-Ul-Amin et al. [1] propose a technique for 
predicting the protein function based on fc-core decomposition. 
In [23], Wuchty and Almaas apply the fc-core decomposition to 
identify the layer structure of the protein interaction network. 
In addition, the fc-core decomposition is recently used to 
identify the influential spreaders in complex network [18]. 
In [18], Kitsak et al. find that the nodes located in the high 
order core are more likely to be a influential spreader. Another 
line of research is to investigate the fc-core percolation in a 
random graph [14], [15], [11]. These studies mainly focus on 



investigating the threshold phenomenon of the existence of a 
fc-core based on some specific random graph models. 

From an algorithmic point of view, Batagelj and Zaversnik 
propose an 0{n + m) algorithm for fc-core decomposition 
in general graphs [8]. Their algorithm recursively deletes the 
node with the lowest degree and uses the bin-sort algorithm 
to maintain the order of the nodes. However, this algorithm 
needs to randomly access the graph, thus it could be inefficient 
for the disk-resident graphs. To overcome this problem, in 
[12], Cheng et al. propose an efficient fc-core decomposition 
algorithm for the disk-resident graphs. Their algorithm works 
in a top-to-down manner that calculates the fc-cores from 
higher order to lower order To make the fc-core decomposition 
more scalable, in [21], Montresor et al. propose a distributed 
algorithm for fc-core decomposition by exploiting the locality 
property of fc-core. All the above mentioned algorithms are 
focus on fc-core decomposition in static graph except for [20]. 
For the dynamic graph, in [20], Miorandi and Pellegrini apply 
the 0{n + m) algorithm given in [8] to recompute the core 
number of the nodes when the graph is updated, which is 
clearly inefficient. In the present paper, we propose a more 
efficient core maintenance algorithm in dynamic graphs. Our 
algorithm are quite efficient, which is more than 100 times 
faster than the re-computation based algorithm. 

VI. Conclusions 

In this paper, we propose an efficient algorithm for main- 
taining the core number of nodes in dynamic graphs. For a 
node u, we define a notion of induced core subgraph G„, 
which contains the nodes that are reachable from u and have 
the same core number as u. Given a graph G and an edge 
(w, v), we find that only the core number of nodes in Gu or 
Gt, or Guvjv may need to be updated after inserting/deleing 
the edge (u, u). Based on this, first, we introduce a coloring 
algorithm to identify all of these nodes. Second, we devise 
a recoloring algorithm to determine the nodes whose core 
numbers definitely need to be updated. Finally, we update the 
core number of such nodes by a linear algorithm. In addition, 
we develop two pruning strategies, namely X-pruning and Y- 
pruning, to further accelerate the algorithm. We evaluate our 
algorithm over 15 real- world and 5 large synthetic datasets. 
The results demonstrate the efficiency of our algorithm. 
References 

[1] M. Altaf-Ul-Amin, K. Nishikata, T. Koma, T. Miyasato, Y. Shinbo, 
M. Arifuzzaman, C. Wada, M. Maeda, T. Oshima, H. Mori, and 
S. Kanaya. Prediction of protein functions based on k-cores of protein- 
protein interaction networks and amino acid sequences. Genome 
Informatics, 14, 2003. 

[2] J. L Alvarez-Hamelin, M. G. Beiro, and J. R. Busch. Understanding 
edge connectivity in the internet through core decomposition. Internet 
Mathematics, 7(l):45-66, 2011. 

[3] J. L Alvarez-Hamelin, L. Dall'Asta, A. Barrat, and A. Vespignani. k- 
core decomposition: a tool for the visualization of large scale networks. 
CoRR, abs/cs/0504107, 2005. 

[4] J. L Alvarez-HameHn, L. Dall'Asta, A. Barrat, and A. Vespignani. 
Large scale networks fingerprinting and visualization using the k-core 
decomposition. In NIPS, 2005. 

[5] J. I. Alvarez-HameUn, L. Dall'Asta, A. Ban'at, and A. Vespignani. K- 
core decomposition of internet graphs: hierarchies, self-similarity and 
measurement biases. NHM, 3(2):371-393, 2008. 



14 



[6] A.-L. Barabasi and R. Albert. Emergence of scaling in random networks. 
science, 1999. 

[7] V. Batagelj, A. Mrvar, and M. Zaversnik. Partitioning approach to 
visualization of large graphs. In Graph Drawing, pages 90-97, 1999. 

[8] V. Batagelj and M. Zaversnik. An o(m) algorithm for cores decompo- 
sition of networks. CoRR, cs.DS/03 10049, 2003. 

[9] M. Baur, U. Brandes, M. Gaertler, and D. Wagner. Drawing the as graph 
in 2.5 dimensions. In Graph Drawing, pages 43^8, 2004. 
[10] S. Carmi, S. Havlin, S. Kirkpatrick, Y. Shavitt, and E. Shir. A model of 
internet topology using k-shell decomposition. PNAS, 104(27):1 1 150- 
11154, 2007. 

[11] D. Cellai, A. Lawlor, K. A. Dawson, and J. P. Gleeson. Tricritical point 

in heterogeneous k-core percolation. Physical review letters, 107. 
[12] J. Cheng, Y. Ke, S. Chu, and M. T. Ozsu. Efficient core decomposition 

in massive networks. In ICDE, 2011. 
[13] J. Cohen. Trusses: Cohesive subgraphs for social network analysis. 

Technique report, 2005. 
[14] S. N. Dorogovtsev, A. V. Goltsev, and J. F. F. Mendes. k-core 

organization of complex networks. Phys. Rev. Lett., 96(4), 2006. 
[15] A. V. Goltsev, S. N. Dorogovtsev, and J. F. F. Mendes. k-core (bootstrap) 

percolation on complex networks: Critical phenomena and nonlocal 

effects. CoRR, abs/cond-mat/060261 1, 2006. 
[16] R. A. Hanneman and M. Riddle. Introduction to social network methods. 

Online book, 2005. 

[17] M. Kitsak, L. K. Gallos, S. Havhn, F Liljeros, L. Muchnik, H. E. 
Stanley, and H. A. Makse. An automated method for finding molecular 
complexes in large protein interaction networks. BMC Bioinformatics, 
4, 2003. 

[18] M. Kitsak, L. K. Gallos, S. Havlin, F Liljeros, L. Muchnik, H. E. 

Stanley, and H. A. Makse. Identification of influential spreaders in 

complex networks. Nature Physics, 6:888-893, 2010. 
[19] J. Leskovec. Standford network analysis project. 2010. 
[20] D. Miorandi and F. D. Pellegrini. K-shell decomposition for dynamic 

complex networks. In WiOpt, 2010. 
[21] A. Montresor, F. D. Pellegrini, and D. Miorandi. Distributed k-core 

decomposition. In PODC, pages 207-208, 2011. 
[22] S. B. Seidman. Network structure and minimum degree. Social networks, 

5(3):269-287, 1983. 
[23] S. Wuchty and E. Almaas. Peeling the yeast protein network. Pro- 

teomics, 5, 2005. 

[24] R. Zafarani and H. Liu. Social computing data repository at ASU, 2009. 
[25] Y. Zhang and S. Parthasarathy. Extracting analyzing and visualizing 
triangle k-core motifs within networks. In ICDE, 2012. 



